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I.  Introduction 


We  consider  here  the  following  problem:  a  student,  with  a  limited 
time  budget,  must  study  for  an  examination.  The  examination  will  consist 
of  several  questions,  one  from  each  of  several  fields.  The  student  will 
be  successful  (pass  the  exam)  if  he  answers  a  majority  of  the  questions 
correctly.  The  problem  is  to  decide  how  much  time  to  spend  on  each  of 
the  several  fields. 

Mathematically,  we  assume  the  subject  is  divided  into  n  fields. 

For  i=l,  ...,  n  we  assume  a  function 

Pi  *  fi(xi) 

gives  the  probability  that  the  question  on  the  i—  field  will  be 
correctly  answered  if  the  student  spends  x..  units  of  time  on  that 
field.  For  obvious  reasons,  we  shall  assume  each  f^  is  monotone 
non-decreasing  and  continuous,  and  bounded  below  by  0  and  above  by  1. 

Let  N  =  {l,2,...,n}  be  the  set  of  all  questions.  If  the  student 
has  probability  p.  of  answering  question  i  correctly,  and  if  all 
these  probabilities  are  independent,  then,  for  given  S  C  N, 

O)  Pc(Pi,  •••*  Pj  s  n  p,  n  (1-p.) 

*  1  n  ieS  VS  1 

is  the  probability  that  the  student  answers  all  the  questions  in  fields 
ieS,  and  none  of  the  others,  correctly. 

Let  m,  now,  be  the  required  number  of  correct  answers.  If  so,  then 
the  student's  probability  of  passing  the  test  is 
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where  the  summation  is  taken  over  all  sets  S  with  at  least  m  elements. 

The  student's  problem,  is,  then,  to  maximize  expression  (2)  subject 
to  (1)  and  the  budget  constraint 


Sxi  <  a 


xi  >  0 


i  *  1,  ....  n 


where  a  is  the  student's  available  time.  The  first  order  conditions 


for  this  problem  are 


3F  dp. 

Ip7  "  X  if  xi  >  0 

3F  dp. 

357  1X  1f  xi " 0 


In  the  general  case,  of  course,  this  presents  a  complicated 
computation.  We  will  consider  the  special  case  where 


P<(x) 


0  <  x  <  1 
x  >  1  . 


This  is  not  an  unreasonable  probability  function:  It  represents  the 
case  where  the  student  requires  unit  time  (the  time  can  of  course  be 
suitably  normalized)  to  read  each  section  of  the  textbook.  In  less  than 
unit  time,  he  can  only  read  a  proportional  fraction  of  the  section,  and 
the  probability  of  a  correct  answer  is  in  turn  proportional  to  that. 


In  this  case,  the  first-order  conditions  take  the  form 


=  X 

If 

>  X 

if 

<  X 

if 

0  <  <  1 
Pi  *  1 

P1  *0 


Now,  it  can  be  seen  that 


(8) 


3F 

3Pi 


z  n  p  n  (i-p .) 
s  j€S  J  j*s  J 
ieS  jj«i 
s=m 


where  the  sum  is  taken  over  all  sets  S,  containing  i  and  exactly  m-1 
other  elements.  We  shall  use  F^  to  denote  this  partial  derivative. 

We  prove,  now,  that  we  need  only  consider  points  (p1,...,Pn)  in 
which  each  p^  has  one  of  the  three  values  0,  1,  and  some  other  p. 

Lemma  1:  The  maximum  of  the  function  F  ,  subject  to  the  constraints 
(3)-(4),  Is  attained  at  a  point  (p1t...pn)  whose  components  have  only 
one  value  other  than  0  or  1; 

Proof:  Let  us  consider  the  expression  (8)  for  F^  .  Letting 
Mi,  we  can  write  this  as 

f  «  z  n  p  n  (i-p.)  +  z  n  p.  n  (i-pj 

1  S  j€S  3  j<S  3  S  jcS  3  j*S  3 

i€S  j/H  j/H 


where  the  first  sum  is  taken  over  all  S  with  ieS,  1<S,  s*m-l  , 
and  the  second  over  all  S  with  1,i<S,  s»m-l  .  We  rewrite  as 
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or  equivalently. 


(9)  Fi  •  Po  i  nn  n(i-p.)  +  (i-pj  i  n  n(i-p.) 

1  &  s  pj  2  1  s  pj  J 

where  the  first  sum  is  taken  over  all  S  with  i,£*S,  s=m-2  ,  and  the 
second  over  all  S  with  i,£*S  ,  sam-l  .  In  each  case  the  first 
product  is  over  all  j«S  ,  the  second  over  all  j  e  N-S-{i ,£}. 

We  have,  then, 

Fr  Fi  ■  ‘Pi  -  Pi>[f  "Pj  K'-Pj)  -  *  ”p4  "('-Pj)] 


where  the  two  sums  are  as  in  (9),  or  equivalently. 


Fi  •  Fi  ■  (Pi  -  pi>  Hii 


where 


s  n  p.  n  (1-p.)  -  e  n  p.  n  (l-p.) 
s  jcs  J  jts  2  s  jcs  J  j*s  J 
s=m-2  jj*l,  s*m-l  j?i. 


where  the  sums  in  (11)  are  over  all  subsets  SCN-{1,£}  with  m-2 
and  m-1  elements  respectively.  We  note,  inter  alia,  that  HiJt 
depends  on  p^,  j/i,£,  but  does  not  depend  on  p^  or  p^. 

Let  X  be  the  set  of  all  p*(p^,...,pn)  which  maximize  F 
subject  to  (3)-(4).  By  continuity  of  F  ,  X  will  be  compact  and 
non-empty.  Then  C( X)  ,  the  convex  hull  of  X  ,  is  compact  and  convex; 
moreover,  the  extreme  points  of  C(X)  are  all  points  of  X  (though  not 
all  points  of  X  are  necessarily  extreme  in  C(X) ) .  We  claim,  now,  that 


if  p*s(p-|*,...,Pn*)  is  extreme  in  C( X)  ,  the  components  Pj 
will  have  at  most  one  value  other  than  0  or  1. 

In  fact,  suppose  there  is  some  pair  of  indices  i,Jt,  such  that 

0  <  Pi*  <  p£  *  <  1. 

Since  peX  ,  then  by  (7-ii),  we  have 

F i ( P* )  *  Ft(p*) 

Now,  by  (10) 

frVOt-n'  Hu  <"*>• 

However,  Pj*  <  p*  ,  and  so  we  must  have  *  0. 

As  was  pointed  out  above,  however,  is  independent  of  both 

p^  and  p^  thus,  for  any  t  ,  the  point  p * ( t)  ,  given  by 

P1 * (t)  *  Pj*  ♦  t 

p  •  (t)  *  p  *  -  t 

p . * ( t)  =  p,*  for  all  other  j 
J  J 

will  also  have  h^(p')  =  0*  For  sufficiently  small  t  (both 
positive  and  negative)  p'(t)  will  satisfy  the  constraints  (3)-(4). 
Moreover,  the  directional  derivative  in  the  direction  of  increasing  t 
is  Fj-F^,  and  this  will  be  0  for  all  values  of  t.  Thus,  for 
sufficiently  small  t, 


%>V  *.  M."'  M  A. 


1 


F(p* (t) )  =  F(p* (— t) )  *  F(p*) . 

Since  p*  maximizes  F  ,  so  do  p * ( t )  and  p'(-t).  But  this  means 
both  p'(t)  and  p'(-t)  belong  to  X  ,  and,  since 


P*  =  \  (p'(t)  +  p'(-t)) 


we  conclude  that  p*  is  not  extreme  in  C(X)  .  This  contradiction 
proves  the  lemma. 

We  see,  then,  that  the  maximum  of  F  will  always  be  found  at  a  point 


of  the  form 


(  1  jCMl 

J  (  0  j  C  M, 


where  M-j,  M2,  Mg  are  disjoint  sets  whose  union  is  N  ,  with 
cardinalities  m-j,  m2,  and  mg  ,  while  0  <  p  <  1  .  We  have  then 


m<l  ^  ^  mj  s  n  • 

i  m-j  +  m2p  *  a. 

It  is  easy  to  see  that,  in  this  case,  we  will  have 


F  -  ?  ft)  pS  (1-p) 
sam-m-  v  / 


In  fact,  all  members  of  M1  are  always  correct,  and  all  members  of 
Mg  are  always  wrong.  Thus  the  student  will  pass  the  exam  if  and  only 
If  at  least  m-m,  of  the  members  of  M0  are  answered  correctly. 


■»  »  *  t 
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•l"  **."■  V  «*.  •. 


rUi£UUU:<j:kC>ilM'>iVlUtl.3U'>2>UliYi 


Lemma  2;  If  a>m,  then  F  is  maximized  by  setting  m-|>m.  If  eea. 
then  F  is  maximized  by  setting  m-|=0,  i.e.,  M^=0. 

Proof :  if  a  >_ m,  it  is  easy  to  see  that  F  can  be  made  equal  to  1 
simply  by  letting  m-|>m.  This  is  clearly  a  maximum. 

Suppose,  in  fact,  that  a<m,  but  M.|f0.  Then  m-|<ct<m,  so 
m2>0  as  otherwise  we  would  have  F=0  .  Let  i€Mp  then 

p.=l  and  0<Pj<1  ,  so  assuming  p  is  optimal,  we  must  have 


Fi  i  Fr 


Now,  however. 


(m0  \  ni"iTii  m-,  ^  fn«“ni 

(’  m0-l  \  m-m.-l  m.+n 

4-’)p  °'p) 


(since  as  we  saw  before,  F.  is  simply  the  probability  that  exactly  m-1 

J 

answers  other  than  j  be  correct).  Thus  we  have 

(m2  \  m-m.  m.+nu-m  ^  /  m2  ^  \  m-m.-l  m.+m9-m 
m-m,  )  p  1  (1-p)  1  2  i  ^m-mrV  p  1  (1-p)  1  2 


which  reduces  to 


mgP  ^  1 

m-m^  — 


By  (14),  however,  this  gives  us  a  >  m  which  is  a  contradiction.  Thus, 


a 


if  a  <  m,  then  at  the  optimum,  M-j  =  0  as  claimed.  Q.E.D. 

From  Lemma  2  we  see,  then,  that  in  the  "difficult"  case,  a  <  m,  we 


have  m^  =  0.  Denote  by  K,  then  =  N-K,  and  so  the  optimum 


will  be  obtained  at  a  point 


’j  '  { 


f  if  J£IC 


0  if  j  €  K 


where  K  has  k  elements.  In  this  case 


A  ft)  ft)  ’  ft?) 


and  we  look  for  the  value  of  k,  m  <_  k  <_  n,  which  maximizes  this  expression: 


<T<  n  s?n,  (9  (9  S  fc2) 


In  general,  we  can  obtain  this  number  from  tables  of  the  cumulative 


normal  distribution.  To  get  an  idea  of  its  behavior,  however,  we  let 


qkU> 


ft)  ft)  ’  ft?) 


be  the  probability  of  exactly  s  correct  answers,  assuming  that  the 
student  divided  his  time  among  k  sections.  Then 

qk(s)  k  (k-l)11'1  (k-c.)k‘s 

qk.,(s)  k-s  kk  (k-a-l)k‘s‘' 

As  a  +  0,  this  expression  approaches  the  limit 

09)  Lk(s>  ■  £  (f) 5  • 

Now,  it  is  easy  to  see  that,  for  k  >  0,  and  s  >  1, 


and  so  Lk( s )  >  1  for  s  >  1.  We  conclude  that,  for  small  values  of 
a,  qk(s)  >  qk_^(s)  for  a^  *  an<*  a^  s  >  It  and  so  k  should 
be  chosen  as  large  as  possible,  i.e.  k  =  n.  On  the  other  hand,  if  a 
is  large,  i.e.,  sufficiently  close  to  m,  we  know  it  is  best  to  choose 
k  =  m. 

We  conclude,  then,  that  for  small  a  the  student  should  study  some 
of  each  section;  for  large  a  (i.e.,  near  m)  he  should  concentrate  his 
studying  on  m  of  the  sections.  What  is  not  clear  is  (a)  whether  any 
intermediate  values  of  k  (i.e.,  m  <  k  <  n)  are  ever  optimal. 

To  look  at  this  problem  in  some  detail,  we  consider  the  case  n  *  13, 
m  ■  7.  Figure  1  shows  the  result  of  our  calculations:  k  *  13  is  optimal 
for  all  a  <  6.16,  while  k  *  7  is  optimal  for  o  >  6.30.  In  between 


u 


\  * 
& 


there  seem  to  be  five  small  subintervals  where  k  *  12,  11,  10,  9,  8  are 
successively  optimal. 

It  is  not  clear  whether  this  type  of  behavior  always  holds,  though  in 
the  several  cases  studied  by  the  authors  this  is  indeed  the  case.  If  we 
look  at  expression  (18),  we  note  that,  as  a  function  of  o,  these  ratios 
are  convex,  i.e.. 


8  /  qt(s) 

8?  lqk_,(s) 


<  0 


This  suggests  (though  it  does  not  prove)  that  this  type  of  behavior  will 
usually  hold. 

For  small  values  of  m,  it  is  not  difficult  to  show  that  this  is 
indeed  the  case.  For  example,  in  the  case  n*3,  m*2,  we  find  k=3  is 
optimal  for  a  <_  1.125,  with  ka2  optimal  if  a  >_  1.125. 

For  n=5,  ma3,  we  find  that  k*5  is  optimal  if  a  <_  2.117;  k=4  is 
optimal  for  2.117  <_  a  <_ 2.173;  finally,  k*3  will  be  optimal  if  a  > 

2.173. 

The  number  of  correct  answers— -assuming  all  study  was  concentrated 
on  k  sections  of  the  course— is  a  binomial  random  variable  with 
parameters  k  and  jj- ;  its  mean  Is  therefore  a,  and  its  variance  is 
a(  1  -jj-) .  For  large  values  of  m  and  n,  this  can  generally  be  approximated 
by  using  either  the  normal  or  the  Poisson  distribution. 

If  a  is  close  to  m,  say  a  »  m-X.  Then  letting  k»m,  we  would 


and  so  the  number  of  incorrect  answers  among  the  m  sections  studied  is 
a  binomial  variable  with  mean  X.  If  we  use  the  Poisson  approximation, 
the  probability  of  r  incorrect  answers  will  be 

-X,  r 

Qx(r)  =  V- 

In  particular,  the  probability  of  passing  the  exam  is  Q(0),  or  P-x. 

As  against  this,  if  the  student  studies  m+1  sections,  the  number  of 
incorrect  answers  among  the  sections  studied  will  also  be  approximately 
Poisson  with  mean  X  +  1.  To  pass,  at  most  one  can  be  incorrect;  the 
probability  of  passing  is  then 

Qx+i(0)  +  Qx+iO)  =  TX+1(1+X+1) 

and  this  will  be  greater  than  P”x  only  if  X>£-2,  i.e.,  if  a<M+2-£,,  or 
about  a<M-0.718.  Thus  k=M  is  optimal  if  a>M-0.718. 

Suppose,  on  the  other  hand,  a  is  considerably  smaller  than  M.  In 
this  case  concentration  on  k  sections  gives  us  a  binomial  variable 
which  can  best  be  approximated  by  a  normal  variable  with  mean  a  and 
variance  a(l-jr).  To  pass  the  examination,  the  student  requires  at  least 
m  correct  answers,  i.e.,  the  variable  must  have  a  value  at  least  equal  to 
m-^  (the  fractional  modification  is  standard  in  such  cases).  If  a, 
the  mean  of  the  variable,  is  more  than  slightly  below  m-j,  this  probability 


will  be  maximized  by  making  the  variance  as  large  as  possible.  With 


12 


a 


& 


a  fixed,  this  is  done  by  setting  k  as  large  as  possible,  i.e.,  k=n. 
The  probability  of  passing  the  exam  will  then  be  given  by 


P  -  * 


/  a-ifrg  \ 

\VaCl-a/n)/ 


where  $  is  the  cumulative  standard  normal  distribution  function. 

One  interesting  observation  remains  to  be  made,  and  it  concerns  the 
person  who  makes  up  the  exam.  If,  instead  of  asking  one  question  on  each 
section  of  the  course,  he  were  to  choose  n  questions  at  random 
(independently)  from  the  entire  subject  matter  of  the  course,  then  the 
student  who  devotes  a  units  of  time  (where  n  units  would  be 
required  to  know  the  entire  subject)  would  have  probability  a/n  on  each 
question.  In  effect,  this  is  the  same  as  if  the  student  had  devoted 
a/n  units  to  each  of  the  n  sections  of  the  course.  But  we  have  seen 
that  this  is  precisely  the  optimal  study  strategy  for  the  student  who 
spends  a  relatively  small  time  preparing  for  this  exam.  Thus,  such  a 
strategy  on  the  part  of  the  examiner  will  penalize  only  the  students  who 
spend  a  relatively  long  time  preparing,  i.e.,  the  conscientious 
students.  In  other  words,  the  student  who  knows,  e.g.,  80X  of  the  course 
material  will  get  a  grade  of  80X  if  there  is  one  question  from  each 
section,  but  might  fail  if  the  questions  are  chosen  randomly  from  the 
entire  course  matter.  The  student  who  knows  only  30X  of  the  course 
matter  has  the  same  probability  of  passing  under  either  mode  of 
examination. 
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Footnotes 


The  problem  we  consider  here  is  mathematically  analogous  to  problems 
considered  by  the  early  French  mathematician,  Condorcet.  For 
historical  background  and  parallels,  see  Grofman,  Owen  and  Feld, 
1982,  1983  (forthcoming). 
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